Pose context length ext #1567

winglian · 2024-04-25T12:11:33Z

PoSE paper: https://huggingface.co/papers/2309.10400
Model: https://huggingface.co/winglian/Llama-3-8b-64k-PoSE
YAML: https://huggingface.co/winglian/Llama-3-8b-64k-PoSE/blob/main/axolotl/pose.yaml

Add the PoSE technique for extending context length without needing long context data.

…e training context len

arkapal3

I'm pretty sure your PR doesn't quite work for chunks > 2.

arkapal3 · 2024-04-25T20:22:41Z

src/axolotl/utils/trainer.py

+            i for i, token_id in enumerate(input_ids) if token_id in split_on_token_ids
+        ]
+    else:
+        split_indices = [sample_len // chunks]


I don't think this is going to work for any n_chunks > 2 right?

ah, you're right

* PoSE wip * fixes for pose splitting * set pose context len so we can pick that up seperately from the usable training context len * support min sample len and define num chunks * fix chunk splitting * support for curriculum/ordered learning with pose * fix sequence len sort * add curriculum_sampling to pydantic

winglian added 4 commits April 23, 2024 09:12

PoSE wip

f17c9e2

fixes for pose splitting

aacdbc3

set pose context len so we can pick that up seperately from the usabl…

8700784

…e training context len

support min sample len and define num chunks

cd089f9

arkapal3 reviewed Apr 25, 2024

View reviewed changes

winglian added 4 commits April 25, 2024 19:53

fix chunk splitting

6ffa083

support for curriculum/ordered learning with pose

43c4c97

fix sequence len sort

2f45a04

add curriculum_sampling to pydantic

74d1284

winglian merged commit 5294653 into main Apr 27, 2024
7 checks passed

winglian deleted the pose-context-length-ext branch April 27, 2024 16:28

2793145003 mentioned this pull request May 6, 2024

请问有支持PoSE的打算吗？ InternLM/xtuner#644

Open

v-dicicco mentioned this pull request Jan 5, 2025

feat: use SequentialSampler if curriculum_sampling is enabled with sample_packing #2235

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pose context length ext #1567

Pose context length ext #1567

winglian commented Apr 25, 2024

arkapal3 left a comment

arkapal3 Apr 25, 2024

winglian Apr 25, 2024

Pose context length ext #1567

Pose context length ext #1567

Conversation

winglian commented Apr 25, 2024

arkapal3 left a comment

Choose a reason for hiding this comment

arkapal3 Apr 25, 2024

Choose a reason for hiding this comment

winglian Apr 25, 2024

Choose a reason for hiding this comment